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OIPE 



RAW SEQUENCE LISTING DATE: 02/11/2002 

PATENT APPLICATION: US/09/866 , 37 9 TIME: 09:07:06 



Input Set : A:\DIVER1370-7.ST25.txt 
Output Set: N:\CRF3\02112002\I866379.raw 



IT- 



ENTERED 



2 <110> APPLICANT: DIVERSA CORPORATION 

3 SHORT, Jay 

4 KRETZ , Keith 

5 GRAY, Kevin 

6 BARTON, Nelson 

7 GARRETT , James 

8 O'DONOGHUE, Eileen 

10 <120> TITLE OF INVENTION: RECOMBINANT BACTERIAL PHYTASES AND USES THEREOF 
12 <130> FILE REFERENCE: DIVER1370-7 

14 <140> CURRENT APPLICATION NUMBER: US 09/866,379 

15 <141> CURRENT FILING DATE: 2001-05-24 

17 <150> PRIOR APPLICATION NUMBER: US 09/580,515 

18 <151> PRIOR FILING DATE: 2000-05-25 

20 <150> PRIOR APPLICATION NUMBER: US 09/318,528 

21 <151> PRIOR FILING DATE: 1999-05-25 

23 <150> PRIOR APPLICATION NUMBER: US 09/291,931 

24 <151> PRIOR FILING DATE: 1999*04-13 

26 <150> PRIOR APPLICATION NUMBER: US 09/259,214 

27 <151> PRIOR FILING DATE: 1999-03-01 

29 <150> PRIOR APPLICATION NUMBER: US 08/910,798 

30 <151> PRIOR FILING DATE: 1997-08-13 
32 <160> NUMBER OF SEQ ID NOS : 10 

34 <170> SOFTWARE: Patentln version 3.1 

36 <210> SEQ ID NO: 1 

37 <211> LENGTH: 1323 

38 <212> TYPE: DNA 

39 <213> ORGANISM: Escherichia coli 

41 <220> FEATURE: 

42 <221> NAME/KEY: misc_f eature 

43 <222> LOCATION: (1)..(1323) 

44 <223> OTHER INFORMATION: n is any nucleotide 
46 <220> FEATURE : 

4 7 <221> NAME/KEY: CDS 

48 <222> LOCATION: (1)..(1323) 

49 <223> OTHER INFORMATION: 
51 <400> SEQUENCE: 1 

48 



96 



52 


atg 


aaa 


gcg 


ate 


tta 


ate 


cca 


ttt 


tta 


tct 


ctt 


ctg 


att 


ccg 


tta 


acc 


53 


Met 


Lys 


Ala 


He 


Leu 


He 


Pro 


Phe 


Leu 


Ser 


Leu 


Leu 


He 


Pro 


Leu 


Thr 


54 


1 








5 










10 










15 




56 


ccg 


caa 


tct 


gca 


ttc 


get 


cag 


agt 


gag 


ccg 


gag 


ctg 


aag 


ctg 


gaa 


agt 


57 


Pro 


Gin 


Ser 


Ala 


Phe 


Ala 


Gin 


Ser 


Glu 


Pro 


Glu 


Leu 


Lys 


Leu 


Glu 


Ser 


58 








20 










25 








30 






60 


gtg 


gtg 


att 


gtc 


agt 


cgt 


cat 


ggt 


gtg 


cgt 


get 


cca 


acc 


aag 


gec 


acg 



144 
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RAW SEQUENCE LISTING LAT!\ 'J 2/1 1/2002 

PATTNT APPLT™TTON: US/09/866, 379 TTMF: 09:07:06 



336 



Input Set : A:\DIVER1370-7.ST25.txt 
Output Set: N:\CRF3\02112002\I866379.raw 

61 Val Val He Val Ser Arg His Gly Val Arg Ala Pro Thr Lys Ala Thr 

62 35 40 45 

64 caa ctg atg nag gat gtc acc cca gac gca tgg cca acc tgg ccg gta 192 

65 Gin Leu Met Gin Asp Val Thr Pro Asp Ala Trp Pro Thr Trp Pro Val 

66 50 55 60 

W --> 68 aaa ctg ggt tgg ctg aca ccg cgn ggt ggt gag eta ate gec tat etc 24 0 

69 Lys Leu Gly Trp Leu Thr Pro Arg Gly Gly Glu Leu He Ala Tyr Leu 

70 65 70 75 80 

72 gga cat tac caa cgc cag cgt ctg gta gec gac gga ttg ctg gcg aaa 288 

73 Gly His Tyr Gin Arg Gin Arg Leu Val Ala Asp Gly Leu Leu Ala Lys 

74 85 90 95 

76 aag ggc tgc ccg cag tct ggt cag gtc gcg att att get gat gtc gac 

77 Lys Gly Cys Pro Gin Ser Gly Gin Val Ala He He Ala Asp Val Asp 

78 100 105 HO 

80 gag cgt acc cgt aaa aca ggc gaa gec ttc gec gec ggg ctg gca cct 384 

81 Glu Arg Thr Arg Lys Thr Gly Glu Ala Phe Ala Ala Gly Leu Ala Pro 

82 115 120 125 

84 gac tgt gca ata acc gta cat acc cag gca gat acg tec agt ccc gat 432 

85 Asp Cys Ala He Thr Val His Thr Gin Ala Asp Thr Ser Ser Pro Asp 

86 130 135 140 

88 ccg tta ttt aat cct eta aaa act ggc gtt tgc caa ctg gat aac gcg 480 

89 Pro Leu Phe Asn Pro Leu Lys Thr Gly Val Cys Gin Leu Asp Asn Ala 

90 145 150 155 160 

92 aac gtg act gac gcg ate etc age agg gca gga ggg tea att get gac 528 

93 Asn Val Thr Asp Ala He Leu Ser Arg Ala Gly Gly Ser He Ala Asp 

94 165 170 175 

96 ttt acc ggg cat egg caa acg gcg ttt cgc gaa ctg gaa egg gtg ctt 576 

97 Phe Thr Gly His Arg Gin Thr Ala Phe Arg Glu Leu Glu Arg Val Leu 

98 180 185 190 

100 aat ttt ccg caa tea aac ttg tgc ctt aaa cgt gag aaa cag gac gaa 624 

101 Asn Phe Pro Gin Ser Asn Leu Cys Leu Lys Arg Glu Lys Gin Asp Glu 

102 195 200 205 

104 age tgt tea tta acg cag gca tta cca teg gaa etc aag gtg age gec 672 

105 Ser Cys Ser Leu Thr Gin Ala Leu Pro Ser Glu Leu Lys Val Ser Ala 

106 210 215 220 

108 gac aat gtc tea tta acc ggt gcg gta age etc gca tea atg ctg acg 720 

109 Asp Asn Val Ser Leu Thr Gly Ala Val Ser Leu Ala Ser Met Leu Thr 

110 225 230 235 240 

112 gag ata ttt etc ctg caa caa gca cag gga atg ccg gag ccg ggg tgg 

113 Glu He Phe Leu Leu Gin Gin Ala Gin Gly Met Pro Glu Pro Gly Trp 

114 245 250 255 

116 gga agg ate acc gat tea cac cag tgg aac acc ttg eta agt ttg cat 

117 Gly Arg He Thr Asp Ser His Gin Trp Asn Thr Leu Leu Ser Leu His 

118 260 265 270 

120 aac gcg caa ttt tat ttg eta caa cgc acg cca gag gtt gee cgc age 864 

121 Asn Ala Gin Phe Tyr Leu Leu Gin Arg Thr Pro Glu Val Ala Arg Ser 

122 275 280 285 

124 cgc gee acc ccg tta ttg gat ttg ate atg gca gcg ttg acg ccc cat 912 

125 Arg Ala Thr Pro Leu Leu Asp Leu He Met Ala Ala Leu Thr Pro His 



768 



816 
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RAW SEQUENCE LISTING 

PATENT APPLICATION : US/0 9/8 6 6 , 3 7 9 

Input Set : A:\DIVER1370-7.ST25.txt 
Output Set: N:\CRF3\02112002\I866379.raw 



DATE: 0 2/1 1/200, 
TIME : 09:07:07 



126 
128 
129 
130 
132 
13 3 
134 
136 
137 
138 
140 
141 
142 
144 
145 
146 
148 
149 
150 
152 
153 
154 
156 
157 
158 
160 
161 
162 
165 
166 

167 

168 

170 

172 

173 

176 

177 

180 

181 

184 

185 

188 

189 

192 

193 

196 

197 

200 

201 



290 295 
cca ccg caa aaa cag gcg tat ggt gtg aca 
Pro Pro Gin Lys Gin Ala Tyr Gly Val Thr 
305 310 

ttt att gcc gga cac gat act aat ctg gca 
Phe He Ala Gly His Asp Thr Asn Leu Ala 
325 330 
gag etc aac tgg acg ctt ccc ggt cag ccg 
Glu Leu Asn Trp Thr Leu Pro Gly Gin Pro 

340 345 
ggt gaa ctg gtg ttt gaa cgc tgg cgt egg 
Gly Glu Leu Val Phe Glu Arg Trp Arg Arg 

355 360 
tgg att cag gtt teg ctg gtc ttc cag act 
Trp He Gin Val Ser Leu Val Phe Gin Thr 

370 375 
aaa acg ccg ctg tea tta aat acg ccg ccc 
Lys Thr Pro Leu Ser Leu Asn Thr Pro Pro 
385 39 0 

ctg gca gga tgt gaa gag cga aat gcg cag 
Leu Ala Gly Cys Glu Glu Arg Asn Ala Gin 
405 410 
ggt ttt acg caa ate gtg aat gaa gca cgc 
Gly Phe Thr Gin He Val Asn Glu Ala Arg 

420 425 
aga tct cat cac cat cac cat cac taa 
Arg Ser His His His His His His 
435 440 
<210> SEQ ID NO: 2 
<211> LENGTH: 440 
<212> TYPE: PRT 
<213> ORGANISM: 
<400> SEQUENCE: 



300 

tta ccc act tea 
Leu Pro Thr Ser 
315 

aat etc ggc ggc 
Asn Leu Gly Gly 



gat aac 
Asp Asn 

eta age 
Leu Ser 

tta cag 
Leu Gin 
380 
gga gag 
Gly Glu 
395 

ggc atg 
Gly Met 



acg ccg 
Thr Pro 
350 
gat aac 
Asp Asn 
365 

cag atg 
Gin Met 

gtg aaa 
Val Lys 

tgt teg 
Cys Ser 



ata ccg gcg tgc 
He Pro Ala Cys 
430 



gta ctg 
Val Leu 
320 
gca ctg 
Ala Leu 
335 

cca ggt 
Pro Gly 

age cag 
Ser Gin 

cgt gat 
Arg Asp 

ctg acc 
Leu Thr 
400 
ttg gca 
Leu Ala 
415 

agt ttg 
Ser Leu 



Escherichia 
2 



coli 

Met Lys Ala He Leu He Pro Phe Leu 



Pro Gin Ser Ala Phe Ala Gin 
20 

Val Val He Val Ser Arg His 
35 

Gin Leu Met Gin Asp Val Thr 
50 



55 

Lys Leu Gly Trp Leu Thr Pro 
65 70 
Gly His Tyr Gin Arg Gin Arg 
85 

Lys Gly Cys Pro Gin Ser Gly 
100 

Glu Arg Thr Arg Lys Thr Gly 
115 



Ser Glu 

25 
Gly Val 
40 

Pro Asp 

Arg Gly 

Leu Val 

Gin Val 
105 
Glu Ala 
120 



Ser Leu 
10 

Pro Glu 

Arg Ala 

Ala Trp 

Gly Glu 
75 

Ala Asp 
90 

Ala He 
Phe Ala 



Leu He Pro 

Leu Lys Leu 
30 

Pro Thr Lys 
45 

Pro Thr Trp 
60 

Leu He Ala 

Gly Leu Leu 

He Ala Asp 
110 

Ala Gly Leu 
125 



Leu Thr 
15 

Glu Ser 

Ala Thr 

Pro Val 

Tyr Leu 
80 

Ala Lys 
95 

Val Asp 
Ala Pro 



960 



1008 



1056 



1104 



1152 



1200 



1248 



1296 



1323 
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RAW SEQUENCE LISTING 

PATENT APPLICATION: US/0 9/8 66,3 79 



DATE : 0 2/ 1 1/2 0 U I 
TIME: 09:07;Q7 



Input Set : A:\DIVER1370-7.ST25.txt 
Output Set: N:\CRF3\02112002\I866379.raw 



204 Asp Cys Ala He Thr Val His Thr Gin Ala Asp Thr Ser Ser Pro Asp 
-05 130 135 140 

208 Pro Leu Phe Asn Pro Leu Lys Thr Gly Val Cys Gin Leu Asp Asn Ala 

209 145 150 155 160 

212 Asn Val Thr Asp Ala He Leu Ser Arg Ala Gly Gly Ser He Ala Asp 

213 165 170 175 

216 Phe Thr Gly His Arg Gin Thr Ala Phe Arg Glu Leu Glu Arg Val Leu 

217 180 185 190 

220 Asn Phe Pro Gin Ser Asn Leu Cys Leu Lys Arg Glu Lys Gin Asp Glu 

221 195 200 " 205 

224 Ser Cys Ser Leu Thr Gin Ala Leu Pro Ser Glu Leu Lys Val Ser Ala 

225 210 215 220 

228 Asp Asn Val Ser Leu Thr Gly Ala Val Ser Leu Ala Ser Met Leu Thr 

229 225 230 235 240 

232 Glu He Phe Leu Leu Gin Gin Ala Gin Gly Met Pro Glu Pro Gly Trp 

233 245 250 255 

236 Gly Arg He Thr Asp Ser His Gin Trp Asn Thr Leu Leu Ser Leu His 

237 260 265 270 

240 Asn Ala Gin Phe Tyr Leu Leu Gin Arg Thr Pro Glu Val Ala Arg Ser 

241 275 280 285 

244 Arg Ala Thr Pro Leu Leu Asp Leu He Met Ala Ala Leu Thr Pro His 

245 290 295 300 

248 Pro Pro Gin Lys Gin Ala Tyr Gly Val Thr Leu Pro Thr Ser Val Leu 

249 305 310 315 320 

252 Phe He Ala Gly His Asp Thr Asn Leu Ala Asn Leu Gly Gly Ala Leu 

253 325 330 335 

256 Glu Leu Asn Trp Thr Leu Pro Gly Gin Pro Asp Asn Thr Pro Pro Gly 

257 340 345 350 

260 Gly Glu Leu Val Phe Glu Arg Trp Arg Arg Leu Ser Asp Asn Ser Gin 

261 355 360 365 

264 Trp He Gin Val Ser Leu Val Phe Gin Thr Leu Gin Gin Met Arg Asp 

265 370 375 380 

268 Lys Thr Pro Leu Ser Leu Asn Thr Pro Pro Gly Glu Val Lys Leu Thr 

269 385 390 395 400 

272 Leu Ala Gly Cys Glu Glu Arg Asn Ala Gin Gly Met Cys Ser Leu Ala 

273 405 410 415 

276 Gly Phe Thr Gin He Val Asn Glu Ala Arg He Pro Ala Cys Ser Leu 

277 420 425 430 

280 Arg Ser His His His His His His 

281 435 440 

284 <210> SEQ ID NO: 3 

285 <211> LENGTH: 49 

286 <212> TYPE: DNA 

287 <213> ORGANISM: Artificial Sequence 

289 <220> FEATURE: 

290 <223> OTHER INFORMATION: Primer for PCR 

292 <400> SEQUENCE: 3 

293 gtttctgaat tcaaggagga atttaaatga aagcgatctt aatcccatt 49 
296 <210> SEQ ID NO: 4 
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RAW SEQUENCE LISTING 

PATENT APPLICATION: US/0 9/8 66,3 79 



DAT E : 02/1 1 / 2 0 0 2 
TIME: 09:07:07 



Input Set : A:\DIVER1370-7.ST25.txt 
Output Set: N:\CRF3\02112002\I866379.raw 



297 <211> LENGTH: 33 

298 <212> TYPE: DNA 

299 <213> ORGANISM: Artificial Sequence 

301 <J20> FEATURE: 

302 <223> OTHER INFORMATION: Primer for PCR 

304 <400> SEQUENCE: 4 

305 gtttctggat ccttacaaac tgcacgccgg tat 

308 <210> SEQ ID NO: 5 

309 <211> LENGTH: 1901 

310 <212> TYPE: DNA 

311 <213> ORGANISM: Escherichia coli 

313 <220> FEATURE: 

314 <221> NAME/KEY: misc_f eature 

315 <222> LOCATION: (1)..(1901) 

316 <223> OTHER INFORMATION: n is any nucleotide 

318 <400> SEQUENCE: 5 

319 taaggagcag aaacaatgtg gtatttactt tggttcgtcg gcattttgtt gatgtgttcg 
321 ctctccaccc ttgtgttggt atggctggac ccgcgtctga aaagttaacg aacgtaggcc 
323 tgatgcggcg cattagcatc gcatcaggca atcaataatg tcagatatga aaagcggaaa 
325 catatcgatg aaagcgatct taatcccatt tttatctctt ctgattccgt taaccccgca 
327 atctgcattc gctcagagtg agccggagct gaagctggaa agtgtggtga ttgtcagtcg 
329 tcatggtgtg cgtgctccaa ccaaggccac gcaactgatg caggatgtca ccccagacgc 
331 atggccaacc tggccggtaa aactgggttg actgacaccg cgnggtggtg agctaatcgc 
333 ctatctcgga cattaccaac gccagcgtct ggtagccgac ggattgctgg cgaaaaaggg 
335 ctgcccgcag tctggtcagg tcgcgattat tgctgatgtc gacgagcgta cccgtaaaac 
337 aggcgaagcc ttcgccgccg ggctggcacc tgactgtgca ataaccgtac atacccaggc 
339 agatacgtcc agtcccgatc cgttatttaa tcctctaaaa actggcgttt gccaactgga 
341 taacgcgaac gtgactgacg cgatcctcag cagggcagga gggtcaattg ctgactttac 
343 cgggcatcgg caaacggcgt ttcgcgaact ggaacgggtg cttaattttc cgcaatcaaa 
345 cttgtgcctt aaacgtgaga aacaggacga aagctgttca ttaacgcagg cattaccatc 
347 ggaactcaag gtgagcgccg acaatgtctc attaaccggt gcggtaagcc tcgcatcaat 
349 gctgacggag atatttctcc tgcaacaagc acagggaatg ccggagccgg ggtggggaag 
351 gatcaccgat tcacaccagt ggaacacctt gctaagtttg cataacgcgc aattttattt 
353 gctacaacgc acgccagagg ttgcccgcag ccgcgccacc ccgttattag atttgatcaa 
355 gacagcgttg acgccccatc caccgcaaaa acaggcgtat ggtgtgacat tacccacttc 
357 agtgctgttt atcgccggac acgatactaa tctggcaaat ctcggcggcg cactggagct 
359 caactggacg cttcccggtc agccggataa cacgccgcca ggtggtgaac tggtgtttga 
361 acgctggcgt cggctaagcg ataacagcca gtggattcag gtttcgctgg tcttccagac 
363 tttacagcag atgcgtgata aaacgccgct gtcattaaat acgccgcccg gagaggtgaa 
365 actgaccctg gcaggatgtg aagagcgaaa tgcgcagggc atgtgttcgt tggcaggttt 
367 tacgcaaatc gtgaatgaag cacgcatacc ggcgtgcagt ttgtaatgca taaaaaagag 
369 cattcagtta cctgaatgct ctgaggctga tgacaaacga agaactgtct aatgcgtaga 
371 ccggaaaagg cgttcacgcc gcatccggcc actttcagtt ttcctctttc tcggagtaac 
373 tataaccgta atagttatag ccgtaactgt aagcggtgct ggcgcgttta atcacaccat 
375 tgaggatagc gcctttaata ttgacgcctg cctgttccag acgctgcatt gacaaactca 
377 cctctttggc ggtgttcaag ccaaaacgcg caaccagcag gctggtgcca acagaacgcc 
379 ccacgaccgc ggcatcactc accgccagca tcggcggcgt atcgacaatc accagatcgt 
381 aatggtcgtt cgcccattcc agtaattgac gcatccgatc g 

384 <210> SEQ ID NO: 6 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1901 
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VERIFICATION SUMMARY 

PATENT APPLICATION: US/09/866 , 3 79 




Input Set : A:\DIVER1370-7.ST25.txt 
Output Set: N:\CRF3\02112002\I866379.raw 



L:68 M:341 W: (46) "n" or "Xaa" used, for SEQ ID# : 1 
L:331 M:341 W: (46) "n" or "Xaa" used, for SEQ ID# : 5 
L:407 M:341 W: (46) " n " or "Xaa" used, for SEQ ID# : 6 
L:483 M:341 W: (46) "n" or "Xaa" used, for SEQ ID# : 7 
L:674 M:341 W: (46) "n" or "Xaa" used, for SEQ ID# : 9 
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